Interconnection network organization and its impact on performance and cost in shared memory multiprocessors
نویسندگان
چکیده
Interconnection network organization has a signiicant impact on multiprocessor system performance. However, detailed and comprehensive studies of the performance and the eeect of diierent organizations are virtually non-existent. In this study, three interconnection network organizations are evaluated as part of a cache-based shared memory multiprocessor system: tori, multistage and single-stage shuue-exchange networks. A system size is limited to 256 or fewer processors, a range where wiring constraints can be largely ignored. The performance impact of topology choice and switch size and channel width are studied under three diierent constraints: xed switch size and channel width, constant number of switch pins, and constant network cost. The cost model reeects switch size and channel width. We nd that after a certain point the performance advantage of wider channels becomes small, and the network topology and switch size become the determining parameters. Our results show that the multistage network is the best network topology if cost is not the main limiting factor. Otherwise, the single-stage network is the most cost-eeective network topology. 2-D torus networks are seriously limited in terms of performance and cost. The dimension of torus networks needs to be larger than 2 in order to have a reasonable relative performance visa -vis networks of other topologies.
منابع مشابه
Reducing Run Queue Contention in Shared Memory Multiprocessors
Feature No single method for mitigating the performance problems of centralized and distributed run queues is entirely successful. A hierarchical run queue succeeds by borrowing the best features of both. P erformance of parallel processing systems, especially large systems, is sensitive to various types of overhead and contention. Performance consequences may be serious when contention occurs ...
متن کاملModeling and Performance Evaluation of Multi-Processors Organization with Shared Memories
This paper is primarily concerned with theoretical evaluation of the performance of multiprocessors system. A markovian waiting line model has been developed for various different multi-processors configurations, with shared memory. The system is analysed at the request level rather than job level.
متن کاملA Scaleable Multiprocessor Architecture with Multiple Read-Write Memory Model
This paper presents a scalable multiprocessor architecture with multiple access memories and multi-way busses. This parallel architecture with more intelligent memory model and efficient multi-way interconnection network organization is called as CRrCW (Concurrent Read and restricted Concurrent Write) scaleable multiprocessor system. The memory and network model provides concurrent memory acces...
متن کاملCrosspoint Cache Architectures
We propose a new architecture for shared memory multiprocessors, the crosspoint cache architecture. This architecture consists of a crossbar interconnection network with a cache memory at each crosspoint switch. It assures cache coherence in hardware while avoiding the performance bottlenecks associated with previous hardware cache coherence solutions. We show this architecture is feasible for ...
متن کاملLocality-Based Scheduling for Shared-Memory Multiprocessors
The last decade has produced enormous improvements in microprocessor performance without a corresponding improvement in memory or interconnection network performance. As a result, the relative cost of communication in shared-memory multiprocessors has increased dramatically. Although many applications could ignore the cost of communication and still achieve good performance on the previous gene...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Parallel Computing
دوره 25 شماره
صفحات -
تاریخ انتشار 1999